feat: Add DeepSeek R1 support to Chutes provider (#4523) #4525

hannesrudolph · 2025-06-10T22:40:18Z

Description

This PR adds support for DeepSeek R1 models when using the Chutes provider. Previously, when using DeepSeek R1 models via Chutes, the reasoning format wasn't recognized, causing reasoning blocks to be merged with regular content and degrading model performance.

Changes Made

Modified BaseOpenAiCompatibleProvider to expose the client property as protected instead of private, allowing subclasses to access the OpenAI client
Enhanced ChutesHandler to:
- Detect DeepSeek R1 models by checking if the model ID starts with "deepseek-ai/DeepSeek-R1"
- Parse reasoning chunks separately by handling delta.reasoning in the stream
- Apply R1 format conversion for message formatting
- Set appropriate temperature (0.6) for DeepSeek models
Migrated tests from Jest to Vitest format and added comprehensive tests for DeepSeek R1 functionality

Testing

All existing tests pass
Added tests for DeepSeek R1 reasoning format handling
Added tests for temperature settings
Manual testing completed:
- Verified reasoning chunks are parsed separately
- Confirmed R1 format is applied correctly

Verification of Acceptance Criteria

DeepSeek R1 models are properly detected when used via Chutes
Reasoning chunks are parsed separately and not merged with regular content
The R1 format is correctly applied for message formatting
Appropriate temperature settings are used for DeepSeek models

Checklist

Important

Adds support for DeepSeek R1 models in Chutes provider, handling reasoning formats and temperature settings, with tests migrated to Vitest.

Behavior:
- ChutesHandler now detects DeepSeek R1 models by checking if model ID starts with "deepseek-ai/DeepSeek-R1".
- Parses reasoning chunks separately using delta.reasoning in the stream.
- Applies R1 format conversion for message formatting.
- Sets temperature to 0.6 for DeepSeek models.
Code Changes:
- BaseOpenAiCompatibleProvider: client property changed from private to protected.
- ChutesHandler: Implements createMessage() to handle DeepSeek R1 models with <think> tags.
- getModel() in ChutesHandler adjusts temperature for DeepSeek R1 models.
Testing:
- Migrated tests from Jest to Vitest.
- Added tests for DeepSeek R1 reasoning format and temperature settings in chutes.spec.ts.

^{This description was created by}^{for 9810152e6065c25dd3556866edb981515f7b9c3d. You can customize this summary. It will automatically update as commits are pushed.}

src/api/providers/chutes.ts

daniel-lxs

It seems like the tests are being migrated to use .spec due to a monorepo.md rule.

It should also handle the Deepseek Chimera model.

LGTM

mrubens · 2025-06-12T03:25:29Z

Looks like some test conflicts unfortunately. Do you mind updating and I'll take another look at approve once it's done?

- Modified BaseOpenAiCompatibleProvider to expose client as protected - Enhanced ChutesHandler to detect DeepSeek R1 models and parse reasoning chunks - Applied R1 format conversion for message formatting - Set appropriate temperature (0.6) for DeepSeek models - Migrated tests from Jest to Vitest format - Added comprehensive tests for DeepSeek R1 functionality This ensures reasoning chunks are properly separated from regular content when using DeepSeek R1 models via Chutes provider.

… provider

…essage method

daniel-lxs · 2025-06-12T04:21:58Z

@mrubens
The conflicts are solved

mrubens · 2025-06-12T15:37:05Z

src/api/providers/base-openai-compatible-provider.ts

 	protected readonly options: ApiHandlerOptions

-	private client: OpenAI
+	protected client: OpenAI


Is this one necessary to change?

Never mind, I see now

* feat: Add DeepSeek R1 support to Chutes provider (#4523) - Modified BaseOpenAiCompatibleProvider to expose client as protected - Enhanced ChutesHandler to detect DeepSeek R1 models and parse reasoning chunks - Applied R1 format conversion for message formatting - Set appropriate temperature (0.6) for DeepSeek models - Migrated tests from Jest to Vitest format - Added comprehensive tests for DeepSeek R1 functionality This ensures reasoning chunks are properly separated from regular content when using DeepSeek R1 models via Chutes provider. * feat: Enhance DeepSeek R1 support with <think> tag handling in Chutes provider * fix: Correct temperature retrieval in ChutesHandler to use model's info * fix: Update condition for DeepSeek-R1 model identification in createMessage method --------- Co-authored-by: Daniel Riccio <[email protected]>

hannesrudolph requested review from cte, jr and mrubens as code owners June 10, 2025 22:40

github-project-automation bot added this to Roo Code Roadmap and Roo Code Roadmap Jun 10, 2025

github-project-automation bot moved this to Triage in Roo Code Roadmap Jun 10, 2025

github-project-automation bot moved this to New in Roo Code Roadmap Jun 10, 2025

dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. enhancement New feature or request labels Jun 10, 2025

ellipsis-dev bot reviewed Jun 10, 2025

View reviewed changes

src/api/providers/chutes.ts Outdated Show resolved Hide resolved

daniel-lxs approved these changes Jun 10, 2025

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label Jun 10, 2025

daniel-lxs moved this from Triage to PR [Needs Review] in Roo Code Roadmap Jun 10, 2025

hannesrudolph added the PR - Needs Review label Jun 11, 2025

hannesrudolph and others added 4 commits June 11, 2025 23:01

feat: Enhance DeepSeek R1 support with <think> tag handling in Chutes…

6c85bdd

… provider

fix: Correct temperature retrieval in ChutesHandler to use model's info

f1b7d7d

fix: Update condition for DeepSeek-R1 model identification in createM…

4228745

…essage method

daniel-lxs force-pushed the 4523-2 branch from 9810152 to 4228745 Compare June 12, 2025 04:03

mrubens reviewed Jun 12, 2025

View reviewed changes

mrubens approved these changes Jun 12, 2025

View reviewed changes

mrubens merged commit a851ffb into main Jun 12, 2025
12 checks passed

mrubens deleted the 4523-2 branch June 12, 2025 15:39

github-project-automation bot moved this from New to Done in Roo Code Roadmap Jun 12, 2025

github-project-automation bot moved this from PR [Needs Review] to Done in Roo Code Roadmap Jun 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Add DeepSeek R1 support to Chutes provider (#4523) #4525

feat: Add DeepSeek R1 support to Chutes provider (#4523) #4525

Uh oh!

hannesrudolph commented Jun 10, 2025 •

edited by ellipsis-dev bot

Loading

Uh oh!

Uh oh!

daniel-lxs left a comment •

edited

Loading

Uh oh!

mrubens commented Jun 12, 2025

Uh oh!

daniel-lxs commented Jun 12, 2025

Uh oh!

mrubens Jun 12, 2025

Uh oh!

mrubens Jun 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

feat: Add DeepSeek R1 support to Chutes provider (#4523) #4525

feat: Add DeepSeek R1 support to Chutes provider (#4523) #4525

Uh oh!

Conversation

hannesrudolph commented Jun 10, 2025 • edited by ellipsis-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Changes Made

Testing

Verification of Acceptance Criteria

Checklist

Uh oh!

Uh oh!

daniel-lxs left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mrubens commented Jun 12, 2025

Uh oh!

daniel-lxs commented Jun 12, 2025

Uh oh!

mrubens Jun 12, 2025

Choose a reason for hiding this comment

Uh oh!

mrubens Jun 12, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

hannesrudolph commented Jun 10, 2025 •

edited by ellipsis-dev bot

Loading

daniel-lxs left a comment •

edited

Loading